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£h Abstract 

The increasing availability of large-scale data on human behavior has cat- 
alyzed simultaneous advances in network theory, capturing the scaling 

Oh 

properties of the interactions between a large number of individuals, and 
human dynamics, quantifying the temporal characteristics of human ac- 
q tivity patterns. These two areas remain disjoint, however, traditionally 

•l-H 

each pursuing as a separate modeling framework. Here we establish the 
first formal link between these two areas by showing that the exponents 
characterizing the degree and link weight distribution in social networks 
can be expressed in terms of the dynamical exponents characterizing hu- 
man activity patterns. We test the validity of these theoretical predictions 
on datasets capturing various facets of human interactions, from mobile 
calls to tweets. We find evidence of a universal measure, that links net- 
works and human dynamics, but whose value is independent of the means 
of communication, capturing a fundamental property of human activity. 
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Fueled by data collected by a wide range of high-throughput tools and technologies, 
the study of complex systems is currently reshaping a number of research fields, from cell 
biology to computer science. Nowhere are these advances more apparent than in the study 
of human dynamics and social media. Indeed, the unparalleled use of email, mobile devices 
and social networking have provided researchers access to massive amounts of data on the 
real time activity patterns of millions of individuals, simultaneously fueling advances in two 
research areas, network science [IHU and human dynamics [SHE]. Network science focuses 
on the structure and dynamics of complex networks that capture the totality of interactions 
between individuals, having led to the discovery of a series of generic properties of real 
networks, from the fat tailed nature of the degree distribution P, [10] to predictable patterns 
characterizing the weights or link strengths [TTHT3] . Human dynamics in contrast focuses on 
the temporal aspects of individual interaction patterns, offering evidence that the interevent 
times between consecutive events initiated by an individual follow a fat tailed distribution 
[SJ El EHZI] , representing a significant deviation from a Poisson process predicted by random 
communications. As network theory [Tj [21 HI E] and human dynamics [21 [33] have developed 
in parallel, being pursued as separate lines of inquiry, we lack relationships between the 
quantities explored by them, despite the fact that they often study the same systems and 
datasets. Here we derive a series of generic laws that link the quantities characterizing 
social networks and human dynamics, showing that the widely studied properties uncovered 
independently in the two areas represent two facets of the same underlying phenomena. 

To demonstrate the practical relevance of our results, we compiled four independent 
datasets that together capture most aspects of digital communication that humans are in- 
volved in lately (SI Section 1): 1) Mobile phone data, that summarizes the communication 
patterns of about 4 million anonymized European mobile users during a year period, pro- 
viding access to over 1.2 billion events, representing information on who talks with whom 
and the timing of each call [22]; 2) E-mail traffic within a university, that collects over 
two million email messages sent during an 83 day period exchanged by around 3,000 users 
[El [23]; 3) Twitter data, that records the tweets of about 0.7 million users, containing over 
8 million messages collected between Aug 2009 and Mar 2010 [21H2S]- 4) Online Messages, 
that records more than 500,000 messages sent by approximately 30,000 active users of a 
Swedish dating site over 492 days [HI [27] . 

Two widely studied quantities characterize the underlying social networks: 
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Degree distribution: The degree /^(ti,^) of an individual % represents the total number 
of individuals he/she contacted within the i 2 ] time interval, including both acquaintancy 
and strong ties [28, 29J. The degree distribution Pk(k) = A^ -1 X)i=i — &i) of each studied 
systems can be approximated with a power law PUniES] (Fig. []})). 

p k {k)~k-n, (1) 

where the degree exponent varies between j k = 1.0 for Twitter and 7^ = 4.8 for mobile 
phones (Table [T]). The measurements indicate that for Twitter, email, and online messages 
7fc is independent of time, but for mobile phones decreases from 7^ = 4.19 to 7^ = 3.20 
during a year (P k for different time intervals is shown in SI Section 7). 

Weight distribution: Denoting with w^j (weight) the number of contacts between two 
nodes [TTHT3"] . we measure the weight distribution P w (w) ~ J2i,j3( w ~ w i-*j) f° r different 
dataset (Fig. [lj:), finding that the weight distribution can be approximated with (Fig. ^p) 

P w {w)~w~^, (2) 

where the weight exponent varies between ^ w = 1.51 for mobile phones and 7^ = 1.9 for 
emails (Table [T]). 

To explore the dynamics of human activity, we focus on two frequently measured quan- 
tities HaQHEQl: 

Activity distribution: Denoting with Cj(ti, ^2) the activity, representing the total number 
of communications initiated by individual i within a [ti,^] time interval, we find that the 
activity distribution P C {C) = N' 1 6(C - Q) is fat tailed, following (Fig. [TJl) 

P C (C) ~ C" (1+/3c) , (3) 

where (3c ranges between 0.1 (Twitter) to 3.38 (mobile phones) (Fig. [T|i and Table [IJ Pc 
for different time intervals is shown in SI Section 7). 

Interevent time distribution: A key property of human dynamics is the non-Poissonian 
nature of the interevent time At between consecutive communication patterns [T4HT9~t 13~T] . 
Previous studies have found that PAt(At) ~ At _/3 ° exp(— At/Ai c ), with 0o — 1 (SI Section 
3.1 and Refs. [i~4TTi~9l f3Tj ) . As PAt(At) characterizes the communications between all friends, 
here we define a link-specific interevent time r^j as the total number of communication 
events initiated by user i between two consecutive communications from i to j (32]. For 
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example, ta->c = 3,4, 5 in Fig. [T^i. We measure the probability density function P T ,i( T ) f° r 
each individuals, finding they that all follow broad distributions (see SI Section 6). In Fig.[]J, 
we plot P t (t) = N~ 1 J2iCf 1 P T ,i(T/C i ) averaged over users with Cj > 100 (see Fig. S6 for 
P Ts i for different Cj activity groups), finding that it is also fat-tailed, well approximated by 
(Fig. 

P T (r) ~ T -^\ (4) 

where (3 T characterizes the inhomogeneity of the communication pattern for a pair of users. 
Queuing models predict (3 T = (3q = (fixed queue length) or 0.5 (variable queue length) 
[2HT9J [31]. We find, however, that the empirically observed f3 T varies between 0.2 (online 
messages) and 0.53 (mobile phones) (Table [T]), differences not accounted for by the existing 
theoretical approaches. 

In summary, the underlying social network is characterized by Pk{k) and P w (w), while 
the communication dynamics by P t (t) and Pc(C), each with its system dependent form. 
These two classes of phenomena, and the associated distributions, are treated independently 
in the literature [TJ I3T4T9] . Next we derive a set of relationships between the two phenomena 
and help uncover a universal quantity that appears to be independent of the means of 
communication used by an individual. 

While one expects that the more active is an individual (high Cj), the more friends 
he/she has (high fcj), as shown in Fig. ^p,e and Table [IJ the distributions Pk(k) and Pc(C) 
are not equivalent. To understand the relationship between ki and Cj, we measured for each 
individual how their degree (ki) grows with the number of communication events (Cj) they 
participated in. We find that the individual degree ki can be approximated with (Fig. [2l 



Htuh) ~ C i (t 1 ,t 2 ) ai , (5) 

where the exponent «j, which characterizes the individual's affinity to translate its level 
of activity into new contacts, varies from individual to individual. For each user at» < 1, 
indicating that the degree grows sub-linearly with the activity Cj, a phenomenon known as 
Heaps' law in linguistics [33] . This means that increasing the number of calls has diminishing 
impact on the growth in the number of friends. While the temporal patterns of both ki and 
Cj might be affected by environmental factors and circadian rhythms, we find that Eq. ^ 
is independent of the observational time frame. 



The fact that the exponent a, varies from individual to individual indicates that users with 
similar activity levels acquire degrees at different rates (Fig. [2^). Therefore, a* characterizes 
an individual's ability to add friends given his/her activity level Cj, prompting us to call 
a; sociability. To investigate the demographic variation of sociability, in Fig. we show 
the sociability distribution for all four datasets, finding that P a {a) = N^ 1 — on) is 

bounded between and 1 and decays rapidly on both sides of the peak. We also find that 
a; is largely independent of C$, as indicated by the conditional probability P a (a\C), that 
overlaps for users with different activity C (Fig. [2Jd, inset). Somewhat surprisingly, this 
indicates that sociability, i.e., the ability to establish new contacts, is independent of the 
individual's activity level, representing instead an intrinsic property of an individual. Figure 
[2Jd shows P a (ct) for all datasets, indicating that each communication system is characterized 
by its own distinct P a {a) and average sociability a (see Table [[]& inset of Fig. g). 

Heaps' law ^ allows us to link a node's degree, a network measure, to the bursty patterns 
of human activity. Indeed, let us denote with n i (t 1 ,t 2 ) the probability that individual i 
contacts a new friend j, i.e. someone that i did not contact in the previous 1 2 ] time 
frame. This requires that the waiting time t^j, that characterizes the communication 
between i and j, be greater than Ci(ti, £2)- The probability that the interevent time exceeds 
Ci(t u t 2 ) is lUfah) = Set PrA T ) dr ~ C^ T >\ Yet > (@) indicates that 11* = dh/dQ ~ Cf*" 1 . 
Comparing these two equations, we obtain the scaling relationship 

on + p T ,i = 1- (6) 

Equation ^ indicates that the bursty nature of human activity patterns (/3 T ,i) determines 
the growth of individual degree in the social network via k{ ~ C\ ^ T '* , predicting the observed 
sublinear growth. Therefore, Eq. ^ represents the first bridge between human dynamics 
(e.g. Ti^j and (3 Tii ) and the structure of the underlying social network (e.g. fc, and otj). 
Equation ^ offers a rather strong prediction, indicating that a simple scaling relationship 
holds for each individual, despite the individual variability in a* and /3 T j. For validation we 
need to determine a* and (3 Tti for each individual, which requires sufficient individual statis- 
tics. However, as sociability on is largely independent of individual activity level (inset of 
Fig. |2Jd) , we can focus on active users, for which we have sufficient data, without introducing 
a selection bias. In Fig. [2^, we measure the sociability ot{ and the waiting time exponent 
/3 T i independently for users with C* > 1000 and ki > 100 in the mobile phone dataset (see 



SI Section 6), finding that while a« and (3 T ^ vary considerably between users, P(oti + (3 Tt i) 
is narrowly peaked around 1, offering strong support for Eq. ([6]). As shown in Table [IJ the 
prediction ^ is consistent not only with the exponents measured in each dataset, but also 
with the queuing models p~U EE] [31] (for sake of simplicity, the average a and j5 T are re- 
ported). Perhaps most surprisingly, we find that when rescaled with the average (3 T , Pp T (f3 T ) 
for the different datasets collapse into a single curve (Fig. |2]i) 

Pp T {p T ) = {\ffc)F{p T fft), (7) 

suggesting that the distribution Pp T (f3 T ) of the bursty exponent (3 T captures an inherent 
property of the population, independent of the means of communication. This data collapse 
is quite remarkable, given the difference in the nature of the data (calls, emails, tweets, 
and online messages), timeframes, countries and demographics (phone: about 25% of an 
European country's population [22]; emails: university employees from a different European 
country [Hl|23]; Twitter: mainly US [2U|25]; Online Messages: Swedish teenagers [6| 127]). 
Figure |2]i suggests an exponential growth of F(x) for small x, i.e., F(x) ~ exp(crx), where 
a ~ 6.6 appears to be the same for all datasets (Table [T]), a parameter that will play an 
important role below. 

The scaling law (|5]), together with the sociability distribution P a (a) allows us to derive an 
another relationship between social networks and human dynamics. Indeed, the statistical 
independence between a and C implies 

p k (k) = J 5{k- C a )P a {a)P c {C)dadC, (8) 

indicating that the fat tailed nature of the degree distribution is rooted in the population 
heterogeneity in terms of sociability on and activity Cj. Note that this relationship is inde- 
pendent of the particular form of Pk{k) and Pc(C), being equally valid if they follow power 
laws, stretched exponentials or log-normal distributions. We compared the empirically mea- 
sured Pk with the prediction (|8p for all datasets, obtaining excellent agreement (Fig. [3^-d). 
Therefore, Eq. ^ is the second bridge between human dynamics and the network structure, 
capturing the competition between two phenomena: 

Activity Driven: If Pk{k) is dominated by differences in the users' activity level (the activ- 
ity distribution Pc(C)), we can ignore the variations in P a , replacing individual sociability 
(pi) with a, finding 

P k (k) ~ k^Pcik 1 ^). (9) 
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This limit correctly describes email, twitter, and online messages (Fig. [3^l-c). 

Sociability Driven: If P a (a) dominates, the individuals' activity level (Ci) can be ap- 
proximated with their mean C, and Eq. ^ predicts that the sociability distribution has an 
exponential tail P a (a) ~ exp(— aa/(3 T )) (shaded area in Fig. ^p) that dominates the scaling 
of (j8j, obtaining 

P k (k)~k-( 1+,7 ^c))^ (10) 

indicating P^ has a power law tail, whose exponent 7& is determined by variability in socia- 
bility, captured by the parameter a. More interestingly, it predicts that 7*; decreases with 
the average activity level C, leading to a scaling exponent that depends on an extensive 
quantity, not observed before in network science. Indeed, as C increases with the obser- 



vation time (Fig. S2), (10) predicts a time-dependent 7^, driven by changes in C. Figure 
3^-f show that despite the temporal stationarity of individual activity (Fig. S8A) for mobile 
communications, 7^ decreases with C for different time interval [^1,^2], indicating that the 
degree heterogeneity of mobile phone users is indeed driven by variability in their sociability. 

Combining ^ with these two classes, we predict the degree exponent in (Jl|, as (SI 
Section 4) 



7^ = 1 + min 



(11) 



l-/V/3 T lnC 

In Table [I] we report the 7c and 7^ of the power law model for all datasets. Yet, Eqs. ([8 10) 
are not limited to power laws; other fat tailed models for Pq such as lognormal or stretched 
exponential can also be exploited, as discussed in SI Section 4. The fundamental relationship 
(Jsj) and the distinction between the two classes is therefore independent of particular models 
(and fits) for Pq- 

To derive the network's weight distribution P w (w) we note that for each individual i, 
J2jWi^j = Ci, where w^j denotes the total number of messages/calls from i to j. We 
denote with p r = p^j = w^j/Ci the probability that user i communicates with user j, and 
r is the rank of Pi^j across all friends j of user i. We find that p r is well approximated by 
Zipf's law p r ~ r (Fig. [34], a direct consequence of the fat tailed nature of P w (w) 
[T1TH3] . That is, an individual communicates most of the time with only a few individuals 
and it interacts with the rest of its contacts with diminished frequencies. Intuitively, one 
would assume ( is the same for individuals with the same activity C. Yet, we find that 
for three randomly selected users, each with the same activity Ci = 400, p r has different 



Q exponents (Fig. |4p,). However, for users with different activities but the same sociability 
a, the curves are indistinguishable (Fig. |4^i), hinting the existence of a link between Q and 
a*. This relationship can be derived by focusing on an individual's least preferred contact. 
Intuitively, there are only a few communications (0(1)) between the individual i and his/her 
least preferred contact, independent of the activity level Cj. Therefore, given ki, the total 
number of contacts of individual i is Cip^ = Cik^ 1 = Cl~ a ^ 1 = 0(1), obtaining [35j 136] . 

cad = 1. (12) 

The validity of (12) is confirmed in Fig. where we show that p T {r l l a ) collapses for users 
with different ctj and the curve has the slope —1 for the top ranked contacts, as predicted 



by (12). Equation (12) allows us to derive the weight distribution P w (w). The weight 



distribution P w {w) is averaged over populations, as 



P w (w) = fj25(w- A(C, a)Cr~ 1/a )P c (C)P a (a)dCda } (13) 

where the normalization factor A(C,a) = AJ2r=i r~ x l a with a system-dependent constant 
A corresponding to the average weight. Figure |4p-f confirms the validity of Eq. (13) for all 



datasets. The fact that Zipf's law is equivalent with P W i(w) ~ w ( 1+1 /Ci) = w ( 1+ai \ where 
P w ,i(w) represent the weight distribution of individual i, leads to a first order approximation 



of Eq. (13) as P w (w) ~ w lw with cutoff, where the exponent 7^ = 1 + a = 2 — f5 T up to 



the leading order. Combining this with ([6]), we find 

lw = 2-J T . (14) 

The prediction (14), supported by the empirical data in Table [TJ represents the third link we 
uncover between a temporal property of human dynamics (/3 T ) and a structural property of 
social networks (j w ). 

In summary, Eqs. Q, ([6]), (J8]), (12) and (13) offer five direct links between human 



dynamics and the architecture of social networks, showing that the degree distribution (Pj.) 
and the tie strength distribution (P w ) can be expressed in terms of the dynamical exponents 
characterizing the temporal patterns in human activity, like burstiness (P r ,i) and the activity 
level (Pc). The relationship between these two classes of exponents is mediated by the 
parameter a, which we find to be independent of the communication technology, hence 
capturing an inherent property of human activity. These relationships bring an unexpected 



order to the zoo of exponents reported in Table [TJ showing that they represent different 
facets of a deeper underlying reality. 

A better understanding of the origin of the exponents reported in Table |T] requires mecha- 
nistic models, tailored to the specific communication phenomena. Yet, the relationships 
(pi), ([8]), (12), and (13) derived here are independent of the system's details or the specific 
communication mechanism, thus all future models that aim to account for human dynam- 
ics and social networks in a specific system must obey them. Hence, the theory developed 
here represents the skeleton of a joint mathematical theory of human dynamics and social 
networks. As our understanding of human dynamics deepens with the emergence of new 
and increasingly detailed data on both human activity patterns and social networks, such 
fundamental relationships are expected to have increasing value, helping us anchor future 
models and offer a springboard towards a deeper mechanistic understanding of big data, the 
often noisy, incomplete, but massive datasets that trail human behavior. 
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[1] G. Caldarelli, Scale-Free Networks (Oxford Finance, 2007). 

[2] R. Cohen and S. Havlin, Complex networks: structure, robustness and function (Cambridge 
Univ Pr, 2010). 

[3] M. Newman, Networks: an introduction (Oxford University Press, 2010). 
[4] S. N. Dorogovtsev and J. F. F. Mendes, Evolution of Networks (Oxford University Press, 
2003). 

[5] C. Castellano, S. Fortunato, and V. Loreto, Reviews of Modern Physics 81, 591 (2009). 

[6] D. Rybski, S. Buldyrev, S. Havlin, F. Liljeros, and H. Makse, Proceedings of the National 

Academy of Sciences 106, 12640 (2009). 
[7] D. Lazer et al., |Science 323, 721 (2009)1 



[8] D. Brockmann, L. Hufnagel, and T. Geisel, Nature 439, 462 (2006). 
[9] R. Albert and A.-L. Barabasi, Reviews of Modern Physics 74, 47 (2002). 



[10] A.-L. Barabasi and R. Albert, Science 286, 509 (1999). 

[11] K.-I. Goh, B. Kahng, and D. Kim, Physical Review Letters 87, 278701 (2001). 

[12] A. Barrat, M. Barthelemy, R. Pastor-Satorras, and A. Vespignani, Proceedings of the National 

Academy of Sciences 101, 3747 (2004). 
[13] A. Barrat, M. Barthelemy, and A. Vespignani, Physical Review Letters 92, 228701 (2004). 
[14] A.-L. Barabasi, Nature 435, 207 (2005). 
[15] J. Oliveira and A.-L. Barabasi, Nature 437, 1251 (2005). 
[16] A. Vazquez, Physical Review Letters 95, 248701 (2005). 

[17] A. Vazquez, J. Oliveira, Z. Dezso, K.-I. Goh, I. Kondor, and A.-L. Barabasi, Physical Review 

E 73, 036127 (2006). 
[18] A. Gabrielli and G. Caldarelli, Physical Review Letters 98, 208701 (2007). 
[19] A. Gabrielli and G. Caldarelli, Physical Review E 79, 041133 (2009). 
[20] R. Malmgren, D. Stouffer, A. Campanharo, and L. Amaral, Science 325, 1696 (2009). 
[21] J. L. Iribarren and E. Moro, Social Networks 33, 134 (2011). 

[22] J. Onnela, J. Saramaki, J. Hyvonen, G. Szabo, D. Lazer, K. Kaski, J. Kertesz, and A.-L. 

Barabasi, Proceedings of the National Academy of Sciences 104, 7332 (2007). 
[23] J. Eckmann, E. Moses, and D. Sergi, Proceedings of the National Academy of Sciences 101, 

14333 (2004). 

[24] M. Cha, H. Haddadi, F. Benevenuto, and K. Gummadi, in 4th International AAAI Conference 

on Weblogs and Social Media (ICWSM) (2010). 
[25] A. Mislove, S. Lehmann, Y. Ahn, J. Onnela, and J. Rosenquist, in 5th International AAAI 

Conference on Weblogs and Social Media (ICWSM) (2011). 
[26] S. Golder and M. Macy, Science 333, 1878 (2011). 

[27] P. Holme, C. R. Edling, and F. Liljeros, Social Networks 26, 155 (2004). 
[28] R. Dunbar, Journal of Human Evolution 22, 469 (1992). 
[29] B. Gongalves, N. Perra, and A. Vespignani, PLoS One 6, e22656 (2011). 
[30] J. L. Iribarren and E. Moro, Phys. Rev. Lett. 103, 038702 (2009)] 
[31] A. Cobham, Journal of the Operations Research Society of America 2, 70 (1954). 
[32] M. Karsai, K. Kaski, A. Barabasi, and J. Kertesz, Scientific Reports 2, 397 (2012). 
[33] H. Heaps, Information retrieval: Computational and theoretical aspects (Academic Press, Inc. 
Orlando, FL, USA, 1978). 

10 



[34] B. Corominas-Murtra, J. Fortuny, and R. V. Sole, Physical Review E 83, 32767 (2011). 
[35] R. Baeza- Yates and G. Navarro, Journal of the American Society for Information Science 51, 
69 (2000). 

[36] L. Lu, Z. Zhang, and T. Zhou, PloS one 5, el4139 (2010). 
FIGURES 



11 



a (User A's contact history) 



c 


F 


B 


C 


E 


D 


F 


C 


B 


B 


E 


D 


C 


E 




A*C 



A->C 



A*C 





10" r 


b 






ID" 1 




10 




10 






<c 


io ; 




10 




10 




io 7 - 




10 



♦ Mobile phone 

• Email 
Twitter 

■ Online Message 



G 10 




10 
10 
10"' 
10 : 

io- 
ur 
i<f 
io - ' 



■V *v 



,1- *♦. 

".V ( 




FIG. 1. Basic measures characterizing networks and human dynamics, (a) The definition 
of t^-s-C) the interevent time captures communication intervals between two individuals, A and C. 
Note that ta->c measures time in terms of the number of events, a feature that corrects for 
daily fluctuations in the communication volume, but has the same asymptotic scaling as the real 
interevent time p3j. (b) Degree distribution -Pfc(fc), and (c) link weight distribution P w {w) for 
each of the four studied datasets. (d) Activity distribution Pc(C). (e) The distribution of the 
number events between consecutive communications with the same individual, P t (t), where r is 
normalized by each individual's activity level C. 
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FIG. 2. Measuring user sociability, (a) The growth in degree ki(t\,t2) for ten mobile phone 
users in function of the same user's activity Ci(ti,t2), where each dot corresponds (Cj, /cj) for one 
time frame [ii,t2]- Similar curves are observed for the other datasets (see Fig. S3), (b) The 
sociability distribution, P a (a), for the three studied datasets, where the shaded region highlights 
the tail of P a (a). Inset: conditional probability distribution P a (a\C) for mobile phone users with 
activity C = 200, 300 and 800, respectively, (c) P(oti +P T ,i) (green curve) measured for the mobile 
phone dataset for users with C{ > 1000 and h L > 100, supporting the validity of Eq. Q. The 
purple and blue curves correspond P a and Pg T for the same user group, respectively, (d) The 
collapse of Pp T {j3 T ) distributions after rescaling Pp T (f3 T ) with average /3 T for each datasets. The 
black line represents a Burr type II distribution, F(x) oc exp(ax)/ (1 + sexp(nx)) with a = 6.6, 
capturing the exponential growth F(x) ~ exp(6.6:r) for small f3 T . 
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FIG. 3. Predicting the Degree Distribution The measured degree distribution Pk(k) (solid), 
compared to the predictions of Eqs. ([8]) (open) and ([9J (cross) for (a) Email (b) Twitter (c) Online 
Message and (d) Mobile Phone datasets, respectively, showing that Eq. Q is consistent with 
empirical observation. For emails we adjusted Q to allow for multiple recipients (see SI Section 
2). The validation of Eq. ^ for Email, Twitter and Online Message datasets also indicates 
these systems are activity driven, (e) Pk(k) for mobile phone dataset, revealing power law tails 
for different time frames AT = t% — tx, from 1 month to 1 year (see SI Section 7 for all datasets). 



(f) The degree exponent jk deceases with average activity C as predicted by (10), indicating that 
mobile phone communication is sociability driven. 
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FIG. 4. Quantifying the tie strength distribution, (a) Zipf 's plot showing the communication 
frequency p r j for a user i with the user's r-th most contacted friend for the mobile phone data. The 
different colors and symbols represent different activities and sociabilities, respectively, indicating 
that the Zipf 's exponent Q depends only on the sociability ctj. (b) The plot of p r versus r 1/,Q showing 



collapses over different sociability groups, as predicted by aid = 1, derived in (12). Similar plots 



are observed for the other datasets (see SI Section 3.3). (c,d,e,f) The degree distribution P w (w) 



from empirically measurements (solid), comparing to the predictions of Eq. (13) for (c) Mobile 
Phone (d) Email (e) Twitter and (f) Online Message datasets, respectively, showing that Eq. (13) 
is consistent with the empirical observation. 

TABLES 
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TABLE I. Quantify networks and human dynamics. The scaling exponents characterizing 
the networks and human dynamics in the four studied datasets, as well as the most studied human 
dynamics models. The reported a and /3 T represent average values over the population for empirical 
data, where f3 T is measured from P t (t) ~ t~( 1+ ^ t ) as a first order approximation. The error of j3 T 
and Pc are derived from the error of 1 + /3 T and 1 + fie > respectively. Note that the small error bars 
of exponents are due to the large population size. See SI Section 6 for justification of the goodness 
of fit. 





Mobile phone 


Email 


Twitter 


Message 


Queuein; 


g Models 












Fixed Length [31] 


Variable Length 1 141 


Ik 


4.19 ± o.oi -T- 3.205 ±0 .oo7 


2-27±o.oi 


1.241± .ooi 


1.624 ±0 .003 






Jw 


1.51335^0.00006 


1.637±o.oo3 


1.8483±o.ooo6 


1.930±o.0O2 






Tt 


0.53823±o.ooooi 


0.431±o.oo2 


0.3162-to.oooi 


0.360±o.oo2 





0.5 


Pc 


3.39-to.oi 


0.82-to.oi 


0.147 ±0 .ooi 


0.430±o.oo2 






a 


0.58 ±0 .oi 


0.68±o.02 


0-78±o.oi 


0.70 ±0 .oi 


1.0 


0.5 


a 


6.6 ± o.i 


6.8±o.2 


6.6 ± o.i 


6.6 ± o.i 






InC 


3.4 5.9 


4.8 


5.4 


3.0 
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